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RecA ASSlSTED CLONING OF DNA 

Field of the Invention 

The present invention relates to the sequence-specific cloning and iabefing of DNA using the RecA protein. More 
4 specifically, the invention relates to the ability of RecA to selecth/ely pair oligonucleotides to their homologous sequences 

5 at the 3' recessed ends of digested duplex DNA fragments and to protect these 3' ends from enzymatic conversion to 
« bhint ends, thus facilitating cloning of a desired DNA fragment. 

Background of the Invention 
The isolation and cloning of genomic DNA fragments is of paramount importance to the biomedical sciences. 
In this regard, several methods are available to amplify DNA and to isolate selected fragments in pure form. The most 
10 widely used amplification method is the polymerase chain reaction (PCR). In this method^ oligonucleotide primers flanking 
a desired DNA sequence are used to amplify the sequence by repeated rounds of denaturation, annealing and extension 
steps. However, a major Gmitation of PCR is the smaH fragment size which may be reliably amplified, although recent 
improvements have allowed amplification of up to 22 kiiobases (kb| (Cheng et aL, Proe. Natl. Acad. ScL USA, 91:5695, 
1994: Foord et aL, PCR Methods and AppBcations, 3:S149, 1994). 
IS Other widely used methods of cloning genomic DNA fragments involve the construction and screening of DNA 

libraries, most commonly A phage and cosmid vectors. Other vectors are now gaining widespread use for cloning large 
(> 100 kb) segments of DNA including yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs) and 
PI phage derived artificial chromosomes (PACs). Such libraries, however, are difficult to construct and screen. 

£ coB RecA protein has been used to screen libraries and to enrich for a selected DNA fragment (Rigas et aL, 
20 Proc. NatL Acad. ScL USA, 83:9591, 1986; Honigberg et aL, Proc. NatL Acad. ScL USA, B3:9586, 1986; Taidi- 
Laskowski et aL, NucL Acids. Res,, 16:8157, 1988; Sena et aL, Nature Genet., 3:365, 1993; Jayasena et aL, J. MoL 
BioL, 230:1015, 1993). These methods are based on the ability of RecA to specifically target single-stranded DNA to 
complementary target duplex DNA to create a three-stranded complex (Camerim-Otero et aL, CeB, 73:217, 1993), or to 
pair two complementary single strands to the target duplex DNA to create a four-stranded complex. These strategies 
25 have not been applied to practical problems m molecular biology. 

RecA-Assisted Restriction Endonuclease (RARE) Cleavage is a general and efficient method of targeting 
restriction enzyme cleavage to unique predetermined sites and is described in U.S. Patent Application Serial No. 
08/089,910, the entire contents of which are hereby incorporated by reference, and by Ferrin et aL Wature Genet., 
6:379, 1994). This method is based on the ability of RecA to pair ofigonucteotides to homologous sequences in duplex 
30 DNA to form three-stranded complexes. These complexes protected the selected sites from enzymatic manipulation, and, 
after removal of the complexes, restriction enzyme cleavage was limited to the selected unmethylated sites. This method 
has been used to map and manipulate large segments of DNA (Ferrin, in Genetic engineering: Principles and Methods, 
J. Setiow, Ed., Plenum Press, New York, 17:21-30, 1995; Barton et aL. Genes and Dev., 8:2453, 1994; Heineman et 
aL, J. Viral, 68:33T7, 1994; Gourdon et aL, NucL Acids. Res., 22:4139. 1994). 
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Because of the practical size limitations of PCR ctomng and the labor-intensive steps required in genomic DMA 
library construction and screening, there is a need for a simple, efficient method of tabeltng and cloning large fragments 
of genomic DNA. The present mventton addresses this need. 

Summary of the Invention 

5 One aspect of the present invention is a method of cloning a genomic ONA fragment containing a predetermined 

DNA sequence. The method includes digesting ONA containing a predetermined ONA sequence with at least one 
restriction enzyme which generates 3' recessed ends to produce DNA fragments having 3' recessed ends. The DNA 
fragments are reacted with RecA protein and two otigonucieotides. These oGgonucleottdes are complementary to either 
DNA strand of the fragment containmg the predetermined DNA sequence, in a preferred embodiment, the oligonucleotides 

10 are 30 to 60 nucleotides in length. The resulting fragments are then reacted with a ONA polymerase. As a result, aO 
DNA fragments except the fragment containing the predetermmed DNA sequence become blunt-ended. The 
oligonucleotides are dissociated from the ends of the fragment containing the predetermined ONA sequence. The ONA 
fragments are then figated to a vector having 3' recessed ends complementary to those produced by the restriction 
enzyme. Only the fragment containing the predetermined DNA sequence is incorporated into the vector. The vector can 

15 be a plasmid, such as pBC SK^ or pBS SK\ Advantageously, the vector is a yeast artificial chromosome, bacterial 
artificial chromosome or PI phage artificial chromosome. Preferably, the restriction enzyme is EcoB! or a combination 
of EcoRI and BamHL Advantageously, the DNA polymerase can be the exonuclease-deficient mutant of the Klenow 
fragment of £ coff ONA polymerase i. 

The method can further comprise the step of size fractionating said ONA fragments of step (a) to enrich for 

20 the fragment containing the predetermined DNA sequence. This embodiment can further comprise, prior to the Figatrng 
step, iigating the enriched DNA fragments to a biotinylated duplex containing complementary 3' recessed ends, wherein 
the biotinylated duplex is bound to streptavidin-coated beads. In addition, the method can further comprise amplifying 
the DNA fragment containing the predetermined ONA sequence. Preferably, the amplifying step comprises transfection 
into bacteria or PCR. 

25 The present invention also provides a method of diagnosing a genetic mutation in a mammal in which a variation 

of the above method is used. In this method, the fragment containmg the fragment is ampSfied and it is determined if 
the mutation is present. Amplification can be by growth of the vector in a suitable microorganism or through PCR. 
Detennination of the presence of the mutation can be accomplished by sequencing the fragment Preferably, the mammal 
is a human and the DNA polymerase is the exonuclease-deficient mutant of the Klenow fragment of £ coti DNA 

30 polymerase 1. The method can further comprise the step of size fractionating the DNA fragments of step (a) to enrich 
for the fragment contaoting the mutation. The method can also further comprise prior to the Iigating step, Iigating the 
enriched DNA fragments to a biotinylated duplex containing complementary 3' recessed ends, wherein the biotinylated 
duplex is bound to streptavidin-coated beads. In addition, the method can further comprise amplifying the DNA fragment 
containing the predetermined DNA sequence. 

35 Another aspect of the present invention provides an article of manufacture which includes packaging material 

and at one or more reagents for cloning ot DNA. The reagents for cloning of DNA includes recA, and the packaging 
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material includes instructions for using the reagents to clone DNA, such as by the method described above. The reagents 
can also include one or more restriction enzymes capable of generating 3' recessed ends, QNA polymerase and a vector 
having 3' cohesive ends. The 3' coheshre ends are preferably complementary to the 3' recessed ends generated by the 
restriction enzymes. In one embodiment these restriction enzymes are EcoRI and BamHl. Hie ONA polymerase can be 
5 the Klenow fragment of £ coB ONA polymerase 1, and the vector can be a plasmid. 

Brief Description of the Drawings 
Rgure 1 is a schematic diagram of the strategy used for sequence-specific RecA-mediated amplification of ONA. 
The RecA-ofigonucleotide complexes are mdicated by the adjacent circles having a line passing through their centers. 
Figure 2 is a schentattc diagram of >l ONA digested with HmdllL The 13 kb fragment labeled by RecA-Assisted 
10 cloning is mdicated by the arrow. The left (U and right (R) 3' ends to which the L and R oligonucleotides are 
complementary are shown. 

Figure 3 is a schematic diagram of the construct resulting from ligation of the 1.4 kb human mt'2 genomic DNA 
fragment to a vector and to a biotinylated DNA duplex bound to a streptavidin-coated bead. EcoRI and ^^/t?^/ restriction 
sites and 4 base pair 5' overhangs'are shown. S, streptavtdin; B, biotin; and P, S' phosphate. 

15 Detailed Descrtptibn of the Preferred Embodiments 

The present invention relates to a method of sequence-specific genomic DNA cloning. This method is based on 
the abffity of £ coE RecA protein to selectively pair ofigonucleotides to their complementary sequences at the ends of 
duplex DNA. Genomic DNA is digested with one or two restriction enzymes which produce 3' recessed ends (5' 
overhangs). After adcfition of RecA protein and a pair of oligonucleotides, each complementary to one of the ends of 

20 a genomic DNA fragment of interest, the resulting three-stranded complexes become resistant to elongation by DNA 
polymerase and thus retain their 3' recessed ends after addition of the enzyme, while the unprotected genomic DNA 
fragments are filled m by the polymerase, thus becoming blunt-ended. Because most restriction endonucleases produce 
fragments havmg 3' recessed ends, these fragments were targeted for amplification using the method of the present 
invention. By using ligation conditions and vectors which greatly favor ligation to 3' recessed ends, protected fragments 

25 are selectively cloned. The vector Into which the genomic DNA fragment is to be inserted is digested with the same 
restriction enzyme(s) as was the genomic DNA, or with a restriction enzyme(s) that produce the same 5' overhangs, 
resulting in complementary 3' recessed ends for insertion of the genomic ONA fragment. 

The oligonucleotides used in the present method are complementary to a portion of the ends of a desired DNA 
fragment, mcluding the 5' overhangs themselves. For example, digestion with EcoRI produces the 5' overhang TTAA. 

30 Thus, the ol^onucleotides are complementary to this sequence plus additional sequence of the genomic DNA fragment 
adjacent to the overhang which is complementary to the remainder of the oSgonucleotide. It will be appreciated that 
an oligonucleotide complementary to one DNA strand is identical in sequence to the other DNA strand. Of course, the 
oligonucleotides can be complementary to either strand at the end of the DNA duplex. Further, the oligonucleotides can 
also include additional nonhomologous sequences at their 5' or 3' ends; although these "nonhomologous tails" do not 

35 ordinarily increase the efficiency of protection. In addition, it is contemplated that only a single end of a desired DNA 
fragment can be protected with an oligonucleotide complementary thereto. In f act, a 200-fold enrichment of a particular 
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DNA sequence cen be obtained by protection of one end of a DMA fragment. This entails cloning using a vector having 
one cohesive and one blunt end, or cutting the DNA fragment with another enzyme after the polymerase reaction. 

The present invention has a number of important applications, including fast DNA cloning, DNA amplification 
in bacteria or by PGR, DNA sequence-based diagnostic tests and automated high-throughput DNA sequencing. The 
5 reagents for performing the method can also be supplied as a diagnostic kit for identification of mutations in a particular 
gene sequence. Genomic DNA can be digested with any single restriction enzyme which produces 3' recessed ends or 
any two different enzymes which produce 3' recessed ends. Such restriction enzymes are well known in the art and 
include, for example, BamfH, EcoRI, Hmdill, Hmfl, Hpail, Mlul and XbaL In the preferred embodiment, oligonucleotides 
30 to 60 bases in length are used, each having complete complementarity to the ends of the desired DNA fragment. 

10 Shorter and longer oligonucleotides can abo be used, although reduced efficiency can sometimes result A DNA 
polymerase and the four deoxynucleoside triphosphates are then added and allowed to flD in all available single-stranded 
sites with the exception of the fragment protected by the RecA>oligonucleotide complexes. In a preferred embodiment, 
The exonuciease-free Kienow Fragment (KF) of £ coB DNA polymerase I is used. This enzyme is efficient at creating 
bhmt ends, can be added in excess without degrading DNA, is blocked by RecA-oGgonucleotide complexes and is easily 

15 inactivated after completion of the reaction. The use of other DNA polymerases in the present method is also 
contemplated. 

The RecA and KF enzymes are then ^activated, causing the RecA/oligonucleotide complex to dissociate from 
the DNA duplex. This inactivation can be accomplished by, for example, treatment with sodium dodecyl sulfate or 
phenoVchloroform extraction. The vast majority of the resulting DNA fragments in the mixture contain blunt ends as a 

20 result of the action of KF. However, the DNA fragment containing the predetermined DNA sequence of interest retains 
its 3' recessed ends as a result of the protection afforded by the RecA/oligonucleotide complex. These fragments can 
then be easily ligated into a vector having complementary 3' recessed ends. 

After enrichment for a particular fragment, the fragment is ligated to a vector containing the appropriate 3' 
recessed ends. The bisert is then amplified by, for example, transf ormmg bacteria with the insert-containing plasmid or 

25 by PCR« Because DNA fragments having complementary 3' recessed ends are ligated to the vector much more readily 
than DNA fragments containing blunt ends, the resulting clones are highly enriched for the selected fragment. 

in a preferred embodiment, in the cloning of genomic DNA fragments, it is desirable to size fractionate the 
digested DNA prior to the RecAfKF protection reection to augment the enrichment of a particular fragment and to 
efiminate the cloning of small fragments. This can be accomplished by, for example, using an agarose gel followed by 

30 recovery of DNA from the relevant molecular weight region of the gel In another preferred embodiment the size 
fractionated DNA is ligated to both a short biotinylated duplex bound to streptavidin-coated beads which terminates with 
the same cohesive end as that produced by digestion with the restriction enzymefs) used to digest the genomic DNA, 
and to a vector which has been digested with a restriction enzyme which produces 3' recessed ends complementary to 
those produced after digestion of the genomic DNA. Alternatively, the ligation reaction can be performed in two steps. 

35 In a preferred embodiment, the vector is a plasmid. Other vectors are also contemplated inchjding bacteriophage vectors 
such as A; eukaryotic expression vectors such as the LacSwitch™ inducible mammafian expression system (Stratagene), 
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adenoviral vectors and the like. Particularly preferred vectors for the propagation of large ONA fragments include YACs, 
BACs and PACs. The vector is then used to transfonn cells which are expanded, resulting in amplification of the 
selected DNA fragment Alternatively, the fragment can be amplified using PCR. 
i RecA-Assisted cloning has sufficient specificity to allow cloning directly from genomic ONA and is a much easier 

5 alternative than construction and screening of DNA libraries. The technique is preferable to PCR in the cloning of large 
(greater than about B kb) or highly repetitive fragments of DNA, especially if absolute fidelity is required due to the lower 
error rate of RecA-Assisted Cloning versus PCR. 

RecA*Assisted Cloning can be used to identify specific mutations in a gene which ghre rise to genetic 
abnormartties and thus is useful in screening patients for genetic abnormafities or mutations which will predispose patients 
10 to certain conditions* Such mutations include point mutations, insertions and deletions* One particular use in this regard 
is in fetal screening. Fetal cells can be obtained by amniocentesis and analyzed for genetic defects including Tay-Sachs, 
sickle ceo anemia, )?-thalasserraas, and any other desired genetic disease. Specific oligonucleotides are designed which 
will hybridize to the 3' ends of the fragment containing the ONA sequence of interest. 

Many modifications of RecA-Assisted Cloning are contemplated. For example, the RecAfKF reactions worked 
15 well on DNA embedded in agarose which wHI be useful for molecules that would tend to shear in solution. For 
applications in which increased specificity is desired, RecA-Assisted Cloning can be used after RARE cleavage, or with 
type lis restriction enzymes that create varied and asymmetric staggered ends unrelated to their recognition sites (Berger, 
Ana/. Biachem., 222:1, 1994). Increases in specificity would also be useful for labeling specific genomic ONA fragments 
using RecA-Assisted Cloning and is a viable alternative to detection methods such as Southern blotting. In addition, if 
20 conditions can be found that allow labeling of very short duplexes, the method can be a useful adjunct to sequencing 
by oligonucleotide array methods (Drmanac et al.. Science 260:1649, 1993). 

. Sequence-specific labeling of a >l DNA fragment using RecA-Assisted Cloning was performed as described below. 

Example 1 

Seouence-soeciBc labeling of a 2.3 kb DNA fragment 
25 £ ctff RecA protein was prepared as described (Ferrin et aU Science, 254:1494, 1991) using an overproducing 

strain provided by Barbara McGrath of the Brookhaven National Laboratory, or purchased from Boehringer Mannheim 
(Indianapolis, IN). The sequence of the L oligonucleotide was 5'-gattatAGCTnTCTAATTTAACCTTTIjTCAGGTTACCA-3' 
(SEQ 10 N0:1l, and the R oligonucleotide was 5'-gattatAGCTn6TGT6CCACCCACTAC6ACCTGCATAA-3'(SEQ ID N0:2). 
The lower case letters indicate sequences of nonhomologous tails, and the capital letters indicate the sequences of 
30 portions homologous to the ends of the^l fragment. Oligonucleotides over 30 bases in length were purified on acrylamide 
gels and concentrations were measured as described (Ferrin et aL, st/prel 

The RecA protection reaction volume was 100 p\ and contained 25 mM Tris-acetate, pH 7.85, 4 mM 
magnesium acetate, 0.4 mM dtthiothreitol, 0.5 mM spermidine, 1.1 mM ADP, 0.3 mM ATP-k-S (Fluka), 13 //g of RecA 
protein, 0.32 //g L^'or R oligonucleotide (or 0.16 //g each of L and R), Z5 //g of MrM//-dige$ted A ONA (New England 
35 Biolabs, Beverly, MA) and 40 /ig bovine serum albumin {BSA; Sigma, St. Louis, MO), 38 juM each of dATP, dCTP, dGTP 
and TTP, and 12.5 units of KF (United States Biochemical, Cleveland, OH). After a 10 minute incubation at dT'C, KF 
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and deoxynucjeoside triphosphates were added and the reaction allowed to proceed for 30 minutes at 37''C. RecA and 

KF were then inactivated by extraction with phenol/chloroform (1:1), followed by extraction three times with diethyl ether, 

addition of sodium acetate to 0.3 M and precipitation with ethanol. The pellets were washed with 70% ethanol followed 

by ligation to the following short radioactive duplex for one hour at room temperature: 

5 5^AGCTTACGATCGATBCCTTGACAT-3' (SEQ ID N0:3) 

3'.ATGCTAfiCTACGGAACTGTAGGAG-5' |SED ID N0:4) 

The Hmdill cohesive end is at the left, and the bottom strand was labeled with k-^P-ATP using polynucleotide 
kinase. The kinase was heat-inactivated at 65''C for 10 minutes and the unreacted k-^P-ATP was removed by gel 

10 filtration (Chroma Spin -f TE-10 columns; Clontech, Palo Alto, CA) prior to adding the top strand. Hie ligation reaction 
had a volume of 40 //i and contained 1.0 //g>( DNA, 0.8 //g labeled duplex, B units of £ coS^Hk ligase, and the buffer 
recommended by New England Biotabs without BSA. Excess duplex was removed by gel filtration followed by addition 
of bromphenol blue and glycerol The samples were heated to GS'C for 3 minutes and analyzed by agarose gel 
electrophoresis* Quantitation was performed using a Fup Phosphor Imager and yiekis were calculated by comparison to 

15 the 2.3 kb band obtained from the reaction mixture containing the L oligonucleotide, but lacking KF after a small 
correction for a portion of the band removed by Ggation to other fragments. 

Efficient labeling of only the 2.3 kb band occurred when the ////irf//Adigested A DMA was incubated with the 
L oligonucleotide, the R oligonucleotide and KF. In this case, the L and R oligonucleotides were used to protect both 
the left and the right ends of the 2.3 kb A DNA fragment followed by ligation of the short labeled duplex to both ends. 

20 When only the L or R oligonucleotide was used, each band on the agarose gel was only about half the intensity of the 
band obtained using both oligonucleotides. No specific labeCng was observed if the ends were not protected (neither 
L nor R present), or when the restriction enzyme used to fragment the starting A DNA produced blunt ends. In 
addition, all of the fragments were labeled when KF was omitted. 

The protection efficiency at each end of the 2.3 kb fragment was about 90%. Nonspecific protection of other 

25 ends was detectable, but less than 0.5%, and labeling of the DNA with blunt ends was undetectable. Only 29 bases 
of sequence information at each end of the duplex was used in designing the oSgonucleotides (33 bases if the 4 base 
single-stranded tail produced by HindHI is counted). A series of nine oligonucleotides was synthesized using an 
automated DNA synthesizer to investigate the parameters thet determine protection ef^ciency. The efficiency was the 
same when the oligonucleotide contained 41 homologous bases, but dropped to 76% with 19 bases, and to less than 

30 1% with 10 bases. 

The oligonucleotides could have the same sequence at either strand at the end of the duplex without changing 
the efficiency. Addition of a tail that extended the oligonucleotide past the end of the fragment did not change the 
efficiency. These results were slightly more favorable than with RARE cleavage, and probably reflected the increased 
stability of complexes formed at the end of duplexes (Kim et aL, «/. MoL BioL, 247:874, 1995). 
35 To demonstrate RecA-assisted cloning using genomic DNA, we cloned a 1.4 kb EcoRl-BamHI fragment of the - 

human int-2 proto-oncogene as described in the following example. 
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Exampfe 2 

RecA-assisted cloninp of oenomic DNA 
The human mt-2 proto-oncogene has been mapped and sequenced {Casey et aL, MoL Cel BioL, 6:502, 1986; 
Brookes et aU Oncogene, 4:429, 1989). In this gene, one EcoRI site lies just upstream of exon 2 and, in about half 
5 of the alleies, a BamHI site is 6.9 kb downstream of the EcoRI site. Human genomic DNA was isolated from multiple 
placentas (Sigma), digested with EcoBI and BmH/, extracted with phenol/chloroform and ethanol prec^itated as described 
in Example 1* ^etds after the ethanol precipitation were typically about 60%. Digested DNA was size fractionated on 
a 0.8% SeaPiaque GTG (FMC BloProducts) agarose gel ip TAE buffer (Sambrook et al.. Molecular Cloning: a Laboratory 
Manual, Cold Spring Harbor Laboratory Press, Plainview, NY, Second Edition, 1989). Multiple wells were loaded with 
10 150-200 //g ONA per well. The gels were run unt9 the 1,4 kb fragment had migrated 4 to 6 cm. The marker iane was 
then removed, stained with ethidium bromide and used as a guide to exdse 0.5 cm above and below the position of the 
1.4 kb fragment. See fractionation resulted h a modest sequence enrichment (about ten fold); a ten-fold decrease in 
the amount of reagents required; and eliminated smafl fragments that would preferentially be represented In the final 
clones. 

15 ONA was extracted from the excised gel using GELase* (Epicentre) according to the manufacturer's directions; 

complete digestion was required for good yields. The yield of the complete protocol was 2 to 4%. Comparable yields 
were obtained with a sifica gel extraction kit IQIagen) or by electroelution (Pun et aL, Prep. Bbchem., 20:123, 1990). 
The size of the extracted DNA from the heavily overloaded gels was checked on analytical gels. Depending on the 
amount available, ONA was quantified by absorbance, fluorescence (Labarca et al.. AnaL Biochem., 102:344, 1980), or 

20 spotting bi an ethidium bromide solution (Sambrook et aL, 1989). 

The size-fractionated doubly digested human placental ONA was used as the startuig ONA for the RecA 
protetn/KF protection reaction. The conditions for this reaction were the same as described in Example 1 with the 
exception that the total volume was 1440 ^1 and contained 3.2 //g of each oligonucleotide, 360 //g RecA protein, 2.6 //g 
fractionated ONA, 570 fj^ BSA and 450 units of KF. One nucleotide was identical to the mt'2 genomic sequence from 

25 2290-2347: S'-GGTCCGAGTGCGCGGAATTCGTCTCACTAAGACACTCCGGTTCTCTCCAAAGCCAGGC-aiSEQ ID N0:5), and 
the other was complementary to 3621-3877: 

5'-TGGTCCTAGCTTGGATCCCATGTACCCTTGGCAAAGCATTCTACTGCCCACATCCCC-3'(SE0 ID N0:6|. EcoRI nni BamHI 
cleave 3' of residues 2304 and 3660, respectively (Casey et aL, 1986; Brookes et aL, 1989). 

The protected fragments were iigated both to the pBS vector (Stratagene, La Jolla, CA) and to DNA bound 

30 to streptavidin beads using T4 DNA Ggase. This step reduces the number of clones containing only vector ONA. When 
plasmid or A vectors were simply Iigated to DNA from the RecA protein/KF reaction, the vast majority of clones did not 
contain an insert. Efforts were made to reduce this background by decreasing the vector concentration, but this also 
lowered the efficiency of the cloning procedure. Due to the low amount of the selected fragment in genomic DNA, a 
large concentration of vector facilitated the tntermolecular vector-fragment ligation. 

35 Ligation of the 1.4 kb fragment to the vector and to the biotmytated DNA duplex bound to the streptavidin- 

coated beads is schematically shown in Figure 3. Magnetic streptavidin beads (2 mg; Dynabeads M*280, Dynal) were 
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used according to the manufacturer's instructions and saturated with the following duplex that contained an EcqR! 
cohesive end that Jacked a 5' phosphate: 

5'.AAnCTACCAGAGGTACAABGAGGA-3' {SEQ ID N0:7) 

3'-GATGGTCTCCATGTTCCTCCTA-5' (SEQ ID N0:8) 

5 

The oligonucleotide shown in SEQ ID N0:8 was synthesized vinth a biotin group at the 5' end using the LC 
Biotin-ON phosphormidite (Clontech). After binding, excess duplex was removed by washing the beads with 1 M NaCI, 
50 mM Tris*HCI, pH 7.5, f oUowed by T4 ONA Egase buffer (New England Biolabs). Vector was prepared by treating 
pBS SK^ with EcoRI, BamHI and calf intestinal phosphatase. The smaD polylinker fragment arising from the digestion 

10 was removed by gel filtration. The ligation reaction contained the washed beads and 80 //I of T4 DMA figase buffer 
with 20% of the DNA from the RecA proteln/KF protection reaction, 0.34 //g of vector and 3,200 units of T4 ONA 
ligase. After 16 hours at 16**^ unligated DNA and vector were removed by washing the beads. The oligonucleotide 
shown in SEQ ID N0:7 (0.3 jirg) was added to replace any removed by washing. The BamHI site on the other side of 
the fragment was available to iigate to the vector. 

15 Immobilized fragment-vector DNA was removed by treatment with 80 units of EtoRI in 100 pX EcoRI buffer. 

The solution containing the fragment-vector ONA was removed from the beads, extracted and ethanol precipitated as 
described in Example 1, except that 20 //g glycogen was added before the ethanoL To ctrcuiarize the fragment-vector 
molecules, the DNA was treated with 1,600 units of T4 ONA ligase m 100 /A of T4 ligase buffer for 16 hours at WZ. 
The ONA was concentrated by ethanol precipitation and used to transform 50 //I of £ coBYlA Blue MRF' (Stratagene) 

20 by electroporation in 0.1 cm cuvettes and a Gene Pulser apparatus (Bio*Rad, Richmond, CA). The cells were prepared 
for electroporation according to the instructions provided by Bio-Rad and yielded 8x10' colonies per//g of plasmid DNA 
using standard OJZ cm cuvettes. Cells were plated on Luria Broth (LB) agar containing ampicillin, tetracycline, 
isopropytthio-)7-D-gatactoside (IPTG) and X-gal (Sambrook et aU 1989). Plasmid DNA was obtained by scraping and 
washing the resulting 7,000 white and SOO blue colonies from the plates with LB. Plasmid DNA was prepared using 

25 a kit (CUagen) and extracted with cetyltrimethylammonium bromide (Ausubel, F., Ed., Current Protocols m Molecular 
Biology, Wiley, New York, 1995) to remove enzyme inhibitDrs. 

Both the starting human placental DNA and the DNA obtained by RecA-Assisted Cloning were digested with 
EcoRI and BamHI and analyzed by agarose gel electrophoresis and Southern blotting. After staining with ethidtum 
bromide, a large smear was observed in the lane containing tBgested human placental DNA, while only vector DNA was 

30 visible in the lane contaming the DNA obtained by RecA-Assisted Cloning, although a faint smear of insert DNA centered 
at around 1.4 kb was observed on another gel containing twice as much ONA. The agarose gel was blotted onto 
charged nylon membranes using standard techniques, and the nylon membrane was probed with a labeled fragment (SS6) 
containing 0.6 kb of the selected int'2 fragment (Casey et aL, 1986; Brookes et aL, 1989). 

The amount of the mt'2 fragment was 20 times greater in the lane containing the RecA-cioned DNA as 

35 compared to the lane containing digested genomic DNA, even though the genomic DNA lane contained 80 times more- 
DNA than the cloned DNA lane. Thus, a 1600-fold enrichment of the fragment was obtamed by RecA-Assisted Cloning. 
These results reflected cloning of a fragment present at only about one copy per diploid human genome. 
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Example 3 

Multiple Cloninn Trials 

Multiple cloning trials were performed using both yeast and human DNA. With human DMA, the typical 
enrichment was 1,000 to 2,000.foH and one mt'2 clone was present for every 2,000 to 4,000 colonies. At least one 
5 mt'2 clone was obtained for every 70 //g of starting genomic DNA. When the pooled DNA after one round of RecA- 
Assisted Cloning was subjected to the procedure a second time, 24% of the colonies contained the mt'2 fragment. The 
procedure was essentially identical to that described rn Example 2, except that the RecA/KF reaction step was scaled 
down by a factor of 14. pBC SK* was used as the vector and clones were selected on chloramphenicol plates to 
eliminate any background from the previous plasmid vector which contained an ampiciHin resistance gene. This 

10 demonstrated an additional 500-fald enrichment and showed that Incorrect clones arose mamly through a stochastic 
process, and not through a biased selection based on partial homology to the mt'2 sequence. A 1.2 kb EcoRIBamHI 
yeast genomic DNA fragment containing the proximal portion of the RAD51 gene (Shinohara et aU Cfii7 69:457, 1992) 
was also cloned. The oGgonucleotides used to clone this fragment had sequences complementary to positions 148: 
5'.TGAAAATATTGAACAGTGAATAAAGCATAAAAAAAAAATGTCGGATCCATAGCGCTAT-3(SEQ ID N0:9), and 1164-1204: 

15 5'GGACTTACCTGTCCTGTCCTGAAnCACCGAAAAGCTCAGTAATAGAACCAGTTTCCACACC.31SEQID N0:10). Theyeast 
genomic DNA was isolated as described by Ausubel (/M/. {p.13)). Conditions were identical to those described in Example 
2, except that the RecA/KF reaction was scaled down by a factor of 14, pBC SK* was used as the vector and clones 
were selected on chloramphenicol plates. 

Plasmid DNA from 10 int-2 clones and 10 RAD51 clones were analyzed by restriction enzyme mapping. No 

20 rearrangements were detected. The sequences of the two vector-insert junctions and abotit 400 bases of insert DNA 
of each of the 20 clones were determined. Plasmid DNA was prepared using a kit (Otagen). Digestions were performed 
with EcoBI and BamHI and analyzed by electrophoresis on an agarose gel. Sequencing was performed on an Applied 
Biosystems model 373 sequencer using their PRISM DyeDeoxy Terminator Cycle Sequencing kit and the M13-20 and 
reverse primers. No clear deviations from the pubGsfaed sequences were detected, hut as ambiguities in the sequences 

25 occurred at a rate of about 1%, this could only be used to set an upper Emit to the error rate of RecA-Asststed Cloning. 
One might expect the error rate to be closer to the m mo error rate in £ cali of 10*^° mutations/bp/chromosome 
duplication (Schaaper et aL, J. Biol. Chem., 268:23762, 1993; Drake, /Vvr. Natl. Acad. Sd. USA, 88:7160, 1991). rather 
than the PCR error rate of about 10^ (Barnes, Proc. Natl. Acad. ScL USA, 91:2216, 1994). 

Although the invention has been described with reference to particular preferred embodiments, the scope of the 

30 invention is defined by the appended claims and should be construed to include reasonable equivalents. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: The Government of the United States, as represented 
by the Secretary, Department of Health and Hianan. Services 

(ii) TITLE OF INVENTION: RecA- Assisted Cloning of Genomic DNA 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Knobbe, Martens, Olson & Bear 

(B) STREET: 620 Newport Center Drive, Sixteenth Floor 

(C) CITY: Newport Beach 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92660 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1,25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Altman, Daniel £. 

(B) REGISTRATION NUMBER: 34,115 

(C) REFERENCE/DOCKET NUMBER: NIH117.001PR 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (714) 760-0404 

(B) TELEFAX: (714) 760-9502 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA s 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GATTATAGCT T1;TCTAATTT AACCTTTTGTC AGGTTACCA 39 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GATTATAGCT TTGTGTGCCA CCCACTACGA CCTGCATAA 39 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:' 
AGCCTACGAT CGATGCCTTG ACAT 24 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE:, NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ATGCTAGCTA CGGAACTGTA GGAG 24 
(2) INFORMATION FOR SEQ ID NO: 5: ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:' linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
GGTCCOAGTG CGCGGAATTC GTCTCACTAA GACACTCCGG TTCTCTCCAA AGCCAGGC SB 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
TGGTCCTAGC TTGGATCCCA TGTACCCTTG GCAAAGCAIT CTACTGCCCA CATCCCC 57 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AATTCTACCA GAGGTACAAG GAGGA 

25 

(2) INFORMATION FOR .SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GATGGTCTCC ATGTTCCTCC TA 

22 . 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TGAAAATATT GAACAGTGAA TAAAGCATAA AAAAAAAATG TCGGATCCAT AGCGCTAT 58 
(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHOTICAL: NO 
(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGACTTACCT GTCCTGAATT CACCGAAAAG CTCAGTAATA GAACCAGTTT CCACACC 57 
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WHAT (S CLAIMED IS: 

I. A method of cloning a DNA fragment containing a predetermined DNA sequence, comprising the steps 

of: 

(a) digesting DNA containing a predetermined DNA sequence with at least one restriction enzyme which 
5 generates 3' recessed ends to produce DNA fragments having 3' recessed ends; 

(b) reacting said DNA fragments with RecA protein and two oligonucleotides, said oligonucleotides 
being complementary to either DNA strand of the fragment containmg the predetermined DNA sequence; 

(c) reacting the DNA fragments resulting from step (b) with a DNA polymerase, whereby all DNA 
fragments except the fragment contammg the predetermined DNA sequence become b(unt*ended; 

10 (d) dissociating said oligonucleotides from the ends of the fragment containing the predetermined DNA 

sequence; and 

(e) figating said DNA fragmems to a vector having 3' recessed ends complementary to those produced 
by the restriction enzyme, whereby only the fragment containing the predetermined DNA sequence is 
incorporated into said vector. 

15 2. The method of Claim 1, wherein said oligonucleotides are between 30 and 60 bases in tength* 

3. The method of Claim 1, wherein said restriction enzyme is EcoffL 

4. The method of Claim 1, wherein two restriction enzymes are reacted m step (a). 

5. The method of Claim 4, wherein said restriction enzymes are EcoM and BBmNL 

6. The method of Claim 1, wherein said DNA polymerase is the Klenow fragment of £ coff DNA 
20 polymerase L 

7. The method of Claim 1, wherein said vector is a plasmid. 

8. The method of Claim 7, wherein said plasmid is pBC SK^ or pBS SK^. 

9. The method of Claim 1, wherein said vector is a yeast artificial chromosome, bacterial artificial 
chromosome or PI phage derived artificial chromosome. 

25 10. The method of Claim 1, further comprising the step of size fractionating said DNA fragments of step 

(a) to enrich for the fragment containrng the predetermined DNA sequence. 

II. The method of Claim 10, further comprising, prior to the ligatmg step, ligating the enriched DNA 
fragments to a biotinylated duplex containing complementary 3' recessed ends, wherein said biotinylated duplex is bound 
to streptavidin-coated beads. 

30 1Z The method of Claim 1, further comprising amplifying the DNA fragment containing the predetermined 

DNA sequence. 

13. The method of Claim 12, wherem said amplifying step comprises transfection into bacteria. 

14. The method of Claim 12, wherein said ampBfying step comprises PCR. 

15. A:method of diagnosing a genetic mutation in a mammal, comprising the steps of: 
35 (a) isolating genomic DNA containing said mutation from a mammal; 
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(b) digesting said genomic DMA with one or more restriction enzymes which generate 3' recessed ends 
to produce genomic DNA fragments having 3' recessed ends; 

(c) reacting said genomic ONA fragments with RecA protein and two ofigonucleotides, said 
oligonucleotides being complementary to the ends of the fragment containing the mutation; 

5 (c) reacting the genomic ONA fragments resulting from step (b) with a DNA polymerase, whereby all 

genomic DNA fragments except the fragment containing the mutation become blunt-ended; 

(d) dissociating said oligonucleotides from the ends of the fragment containing the mutation; 

(e) figating said DNA fragments to a vector having 3' recessed ends complementary to those produced 
by the restriction enzyme(s), whereby only the fragment containing the mutation is incorporated into said vector; 

10 (f) ampFifying the fragment containmg the mutation; and 

(g) determining whether the mutation is present in the amplified fragments. 

16. The method of Claim 15, wherein step (f) comprises growth of said vector in a suitable microorganism. 

17. The method of Claim 15i wherein step (f) comprises PCR. 

18. The method of Claim 15, wherein said oligonucleotides are between 30 and 60 bases in length. 
15 19. The method of Claim 15, wherein step (g) comprises sequencing said fragment 

20. The method of Claim 15, wherein step (d) comprises treating with sodium dodecyl sulfate or 
phenol/chloroform. 

21. The method of Claim 15, wherein said mammal is a human* 

22. The method of Claim 15, wherein said DNA polymerase is the exonuclease-deficient Klenow fragment 
20 of £ coO DNA polymerase I. 

23. The method of Claim 15, wherein said vector is a plasmid. 

24. The method of Claim 15, wherein said vector is a yeast artificial chromosome, bacterial artificial 
chromosome or PI phage derhred artificial chromosome* 

25. The method of Claim 15, further comprising the step of size fractionating said ONA fragments of step 
25 (a) to enrich for the fragment containing the mutation. 

26. The method of Claim 25, further comprising prior to the Ggating step, iigating the enriched DNA 
fragments to a biotinylatad duplex containing complementary cohesive ends, wherein said biotinylated duplex is bound 
to streptavidin-coated beads. 

27. An article of manufacture comprising packaging material and at one or more reagents for cloning of 
30 ONA, wherein the reagents for cloning of DNA comprise recA, and wherein the packaging material comprises instructions 

for using the reagents to clone DNA. 

28. An article of manufacture comprising packaging material and at one or more reagents for cloning of 
DNA, wherein the reagents for cloning of DNA comprise recA protein, and wherein the packaging material comprises 
instructions for using the reagents to clone DNA according to the method of Claim 1. 

35 29. An article of manufacture according to Claim 27, wherein the instructions for using the reagents 

comprise instructions to identify one or more mutations m genomic DNA. 
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30. An article of manufacture according to Claim 27, wherein the reagents additionally comprise one or 
more restriction enzymes capable of generating 3' recessed ends. 

31. An article of manufacture according to Claim 27, wherein the reagents addrtionaHy comprise DNA 
polymerase. 

5 3Z An article of manufacture according to Claim 27, wherein the reagents additionally comprise a vector 

having 3' cohesive ends. 

33. An article of manufacture according to Claim 30, wherein the reagents additionally comprise a vector 
having 3' cohesive ends, and wherein the 3' cohesive ends are complementary to the 3' recessed ends generated by the 
restriction enzymes. 

1Q 34. An ariicle of manufacture according to Claim 30, wherein 

the restriction enzymes comprise EcoRL 

35. An article of manufacture according to Claim 34, wherein the restriction enzymes comprise EcoRI and 

BamHL 

36. An article of manufacture according to Claim 31, wherein the ONA polymerase is the Klenow fragment 
15 of £ coB DNA polymerase 1. 

37. An article of manufacture according to Claim 32, wherein said vector is a plasmid. 
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